Automated Classification of Vowel Category and Speaker Type in the High-Frequency Spectrum

نویسندگان

Jeremy J. Donai

Saeid Motiian

Gianfranco Doretto

چکیده

The high-frequency region of vowel signals (above the third formant or F3) has received little research attention. Recent evidence, however, has documented the perceptual utility of high-frequency information in the speech signal above the traditional frequency bandwidth known to contain important cues for speech and speaker recognition. The purpose of this study was to determine if high-pass filtered vowels could be separated by vowel category and speaker type in a supervised learning framework. Mel frequency cepstral coefficients (MFCCs) were extracted from productions of six vowel categories produced by two male, two female, and two child speakers. Results revealed that the filtered vowels were well separated by vowel category and speaker type using MFCCs from the high-frequency spectrum. This demonstrates the presence of useful information for automated classification from the high-frequency region and is the first study to report findings of this nature in a supervised learning framework.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

High Prevalence of CTXM-15 Type Extended-Spectrum Beta-Lactamase Among Clinical Isolates of Klebsiella Pneumoniae

Background: Production of β–lactamases by enterobacteriacea, especially Klebsiella pneumoniae, is one of the emerging health problems in the world. The purpose of this study was to assess the frequency of blaCTX-M15 gene in K. pneumoniae isolates and determine the molecular diversity of CTXM producing isolates. Methods: In...

متن کامل

Biologically inspired speaker verification

Speaker verification is an active research problem that has been addressed using a variety of different classification techniques. However, in general, methods inspired by the human auditory system tend to show better verification performance than other methods. In this thesis three biologically inspired speaker verification algorithms are presented. The first is a vowel-dependent speaker verif...

متن کامل

Assimilation of Final Low Back Vowel in Eghlidian Dialect

In this article, the low back vowel /A/ in word-final positions in Eghlidian dialect, one of Persian dialects, is studied. This vowel is represented phonetically as [A], [o] and [@] in different phonetic environments. Therefore many words were collected via interviewing ten native speakers so that these different alternant forms can be accounted for appropriately. Since one of the authors of th...

متن کامل

Indexical and linguistic processing by 12-month-olds: Discrimination of speaker, accent and vowel differences

Infants preferentially discriminate between speech tokens that cross native category boundaries prior to acquiring a large receptive vocabulary, implying a major role for unsupervised distributional learning strategies in phoneme acquisition in the first year of life. Multiple sources of between-speaker variability contribute to children's language input and thus complicate the problem of distr...

متن کامل

Discrete Wavelet Transform & Linear Prediction Coding Based Method for Speech Recognition via Neural Network

In the proposed work, the techniques of wavelet transform (WT) and neural network were introduced for speech based text-independent speaker identification and Arabic vowel recognition. The linear prediction coding coefficients (LPCC) of discrete wavelet transform (DWT) upon level 3 features extraction method was developed. Feature vector fed to probabilistic neural networks (PNN) for classifica...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره 6 شماره

صفحات -

تاریخ انتشار 2016

Automated Classification of Vowel Category and Speaker Type in the High-Frequency Spectrum

نویسندگان

چکیده

منابع مشابه

High Prevalence of CTXM-15 Type Extended-Spectrum Beta-Lactamase Among Clinical Isolates of Klebsiella Pneumoniae

Biologically inspired speaker verification

Assimilation of Final Low Back Vowel in Eghlidian Dialect

Indexical and linguistic processing by 12-month-olds: Discrimination of speaker, accent and vowel differences

Discrete Wavelet Transform & Linear Prediction Coding Based Method for Speech Recognition via Neural Network

عنوان ژورنال:

اشتراک گذاری